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A controller receives a document (702) from a customer. The document may be 
an image generated by scanning a paper document. One or more sections (704) of the 
document that are indicated by predetermined indicia are determined. Different sections 
may include different types of information. For example, one section of a document 
might include information that identifies a person, while another section of the document 
might include confidential information about that person. The sections are transmitted 
to one or more workers (706) who perform corresponding data entry. Separating the 
sections can thus preserve the confidential nature of the document. A file is subsequently 
received from each worker. Hie file corresponds to data entry performed on the section 
of the document. The controller determines the customer associated with the file and 
transmits the file to the customer (710). Payment is then rendered, directly or indirectly, 
from the customer to the workers (714). 
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METHOD AND APPARATUS FOR PROCESSING A DOCUMENT 
CONTAINING CONFIDENTIAL INFORMATION 

The present application is a continuation-in-part application of co- 
pending U.S. Patent Application No. 09/1 12,13 1, entitled "METHOD AND 
5 APPARATUS FOR A CRYPTOGRAPHICALLY-ASSISTED COMMERCIAL 
NETWORK SYSTEM DESIGNED TO FACILITATE AND SUPPORT EXPERT- 
BASED COMMERCE", filed on July 8, 1998, which is a continuation of U.S. Patent 
Application No. 08/685,706, filed July 24, 1996, each incorporated herein by 
reference as part of the present disclosure. 

10 

FIELD OF THE INVENTION 

The present invention relates to methods and apparatus for processing 
documents, and more specifically to methods and apparatus that facilitate the 
processing of documents containing possibly confidential information. 

15 

BACKGROUND OF THE INVENTION 

Despite the prevalence of computers and computer-related technology, 
a significant amount of information processing must be performed by humans rather 
than by computers or similar devices. For example, humans continue to be required 
20 in proof reading and amending language-translated text, especially if the translation 
was performed by a computer. Similarly, humans are often required in creating 
computer-based information, such as a text file, from a paper document or a similar 
document that is not computer-based. To do so, a human must typically read printing 

1 
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(e.g. printed characters) on the paper document and manually type text corresponding 
to the printing in order to create a text file. 

Computers can create text files from paper-based documents using a 
technique known as Optical Character Recognition ("OCR"). As is known in the art, 

5 OCR requires that an electronic device examine printed characters on the paper 

document, and determine the shapes of the printed characters by detecting patterns of 
dark and light. Then, character recognition methods are used to translate the shapes 
into computer text and a corresponding text file may be created and stored. 

Unfortunately, OCR is often inaccurate. Any character may assume a 

10 large number of shapes, especially if the character is handwritten. To circumvent this 
disadvantage, some OCR systems are limited to recognizing known typefaces and 
sizes and cannot recognize handwriting at all. Other OCR systems use sophisticated 
algorithms to learn new typefaces and sizes, and may thus increase recognition 
accuracy somewhat. However, even the best OCR system is unable to make the 

1 5 common sense judgments that allow humans to recognize poorly recognizable 
characters and words from their context. Consequently, OCR remains unable to 
consistently generate accurate text files. In fact, it can take so long to proofread and 
correct a text file created with OCR that it would be just as easy to have a human 
manually create a text file by typing the text. 

20 Unfortunately, many businesses have a great need to create computer- 

based information from paper documents. For example, insurance companies and 
other businesses in the health care field must process paper documents such as claim 
requests, payment receipts, new member applications and medical test results. The 
overwhelming majority of these paper documents are processed manually which is 
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extremely expensive and time consuming. Such documents may include confidential 
information, such as the medical information of patients. Consequently, it is 
undesirable to permit people to have access to these documents. 

It would be advantageous to provide a method and apparatus that 
5 facilitates the creation of computer-based information from paper documents. 

SUMMARY OF THE INVENTION 

It is an object of the present invention to provide a method and 
apparatus that facilitates the creation of computer-based information from paper 
10 documents. 

In accordance with the present invention, a controller receives a 
document from a customer. The document may be an image generated by scanning a 
paper document. One or more sections of the document that are indicated by 
predetermined indicia are determined. Different sections may include different types 

1 5 of information. For example, one section of a document might include information 
that identifies a person, while another section of the document might include 
confidential information about that person. The sections are transmitted to one or 
more workers who perform corresponding data entry. Separating the sections can 
thus preserve the confidential nature of the document. 

20 A file is subsequently received from each worker. The file 

corresponds to data entry performed on the section of the document. The controller 
determines the customer associated with the file and transmits the file to the customer. 
Payment is then rendered, directly or indirectly, from the customer to the workers. 



3 
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BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is a schematic illustration of an apparatus for processing a 
document in accordance with the present invention. 

FIG. 2 is a schematic illustration of a controller of the apparatus of 

5 FIG. 1. 

FIG. 3 is a schematic illustration of a customer database of the 
controller of FIG. 2. 

FIG. 4 is a schematic illustration of a worker database of the controller 

of FIG. 2. 

10 FIG. 5 is a schematic illustration of a document database of the 

controller of FIG. 2. 

FIG. 6 is a schematic illustration of a section database of the controller 

of FIG. 2. 

FIG. 7 is a flow chart illustrating a method for processing a document 
1 5 in accordance with the present invention. 

FIG. 8 is a data flow diagram illustrating the transfer of information 
within the apparatus of FIG. 1 during a portion of the method of FIG. 7. 

FIGS. 9 A, 9B, 9C and 9D are images that include indicia for 
identifying sections. 

20 FIG. 10 is an exemplary image including indicia in the form of 

boundary lines that define the perimeters of two sections. 

FIGS. 1 1 A and 1 IB are schematic illustrations of sections of a 

document. 
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FIG. 12 is a flow chart illustrating an embodiment of a portion of the 

method of FIG. 7. 

FIG. 13 is a data flow diagram illustrating the transfer of information 
within the apparatus of FIG. 1 during another portion of the method of FIG. 7. 
5 FIG. 14 is a flow chart illustrating an embodiment of another portion 

of the method of FIG. 7. 

FIG. 15 is a schematic illustration of a form for entering data. 
FIG. 16 is a schematic illustration of a form database. 
FIG. 17 is a flow chart illustrating a method for permitting a section to 
1 0 be processed by a plurality of workers. 

FIG. 18 is a schematic illustration of a record of another embodiment 
of the section database of the controller of FIG. 2. 

FIG. 19 is a flow chart illustrating a method for processing a plurality 
of documents in accordance with processing criteria for the plurality. 

15 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

The present invention allows a business or other entity to automatically 
locate and communicate with one or more workers who in turn create computer-based 
information (e.g. text files or data base entries) for the business from a paper 
20 document. Thus, the business need not consume resources in locating and hiring 
appropriate workers. The business also remains unburdened by the administrative 
costs associated with hiring and retaining such workers in an employment capacity. 

In addition, the present invention further allows information derivable 
from the paper document to remain confidential even if the document is being 

5 
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processed by a worker who may be untrustworthy. Thus, even sensitive documents 
such as medical records may be processed without compromising their confidential 
information. 

The present invention likewise benefits people that can function as 
5 "workers" (described below) even without possessing specialized skills. A market is 
created for such workers, who may work at times they choose and as much as they 
choose. In particular, very small amounts of free time (e.g. minutes) may be spent 
earning pay in exchange for services rendered remotely. Thus, people can earn 
money during times such as lunch breaks or other periods of inactivity. Workers may 
1 0 furthermore maintain their anonymity, should they desire. 

Referring to FIG. 1, an apparatus 100 comprises a controller 1 10 that is 
in communication with a customer device 120 and with worker devices 130 and 135. 
The controller 1 10 may communicate with the customer device 120 and the worker 
devices 130 and 135 via an appropriate network such as the Internet. Each of the 
1 5 controller 1 10, the customer device 120, and the worker devices 130 and 135 may 
comprise computers, such as those based on the Intel® Pentium® microprocessor, 
that are adapted to communicate via the Internet (e.g. via a modem). Any number of 
customer devices and worker devices may be in communication with the controller 
110. 

20 The controller 110 facilitates communication between a customer (e.g. 

a business) using the customer device 120 and one or more workers using worker 
devices such as the worker devices 130 and 135. The communication may be 
performed while maintaining anonymity of each party, and in fact the present 
invention can eliminate the need for a party to identify the other party with which it 
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communicates. Accordingly, the parties may form a relationship that allows 
confidential information to be handled efficiently and without compromise. 

Referring to FIG. 2, the controller 1 10 comprises a processor 202, such 
as the Intel® Pentium® microprocessor. The processor 202 is in communication with 
5 a data storage device 204, such as an appropriate combination of magnetic, optical 
and/or semiconductor memory. For example, the data storage device 204 may 
comprise one or more of a ROM, RAM and hard disk drive. The processor 202 and 
the data storage device 204 may each be (i) located entirely within a single computer 
or other computing device; (ii) connected to each other by a remote communication 
10 medium, such as a serial port cable, telephone line or radio frequency transceiver; or 
(iii) a combination thereof. In one embodiment, the controller 110 may comprise one 
or more computers that are connected to a remote server computer for maintaining 
databases. 

The data storage device 204 stores a program 210 for controlling the 
1 5 processor 202 . The processor 202 performs instructions of the program 2 1 0, and 
thereby operates in accordance with the present invention, and particularly in 
accordance with the methods described in detail herein. The program 210 
furthermore includes program elements that may be necessary, such as an operating 
system and "device drivers" for allowing the processor 202 to interface with computer 
20 peripheral devices. Appropriate device drivers and other necessary program elements 
are known to those skilled in the art, and need not be described in detail herein. 

The storage device 204 also stores (i) a customer database 220, (ii) a 
worker database 230, (iii) a document database 240, and (iv) a section database 250. 
The databases 220, 230, 240 and 250 are described in detail below and depicted with 
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exemplary entries in the accompanying figures. As will be understood by those 
skilled in the art, the schematic illustrations and accompanying descriptions of the 
databases presented herein are exemplary arrangements for stored representations of 
information. A number of other arrangements may be employed besides those 
5 suggested by the tables shown. Similarly, the illustrated entries of the databases 
represent exemplary information, and those skilled in the art will understand that the 
number and content of the entries can be different from those illustrated herein. 

Referring to FIG. 3, a table 300 represents an embodiment of the 
customer database 220 (FIG. 2). The table 300 includes entries 302 and 304, each 
1 0 defining a customer that interacts with the controller 1 1 0 (FIG. 1 ) to have documents 
processed. It will be understood by those skilled in the art that the table 300 may 
include any number of entries. The table 300 also defines fields for each of the 
entries 302 and 304. The fields specify (i) a customer identifier 320 that uniquely 
identifies the customer, (ii) an electronic mail ("email") address 322 for 
1 5 communication with the customer, (iii) a customer name 324, and (iv) a financial 
account identifier 326 that indicates a financial account the customer may use to pay 
for services. 

Referring to FIG. 4, a table 400 represents an embodiment of the 
worker database 230 (FIG. 2). The table 400 includes entries 402, 404 and 406, each 
20 defining a worker that interacts with the controller 110 (FIG. 1) to process documents. 
It will be understood by those skilled in the art that the table 400 may include any 
number of entries. The table 400 also defines fields for each of the entries 402, 404 
and 406. The fields specify (i) a worker identifier 420 that uniquely identifies the 
worker, (ii) an email address 422 for communication with the worker, (iii) a worker 

8 
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name 424, (iv) a financial account identifier 426 that indicates a financial account that 
the worker may use to receive payment, (v) job skills 428 of the worker, (vi) a quality 
rating 430 of the worker, (vii) a speed rating 432 of the worker, and (viii) a fee 
charged 434 by the worker for its services. Those skilled in the art will understand 

5 that a number of other fields may specify further information regarding a worker, such 
as other types of ratings for the worker. 

Referring to FIG. 5, a table 500 represents an embodiment of the 
document database 240 (FIG. 2). The table 500 includes entries 502 and 504, each 
defining a document that a customer needs to have processed by a worker. It will be 

10 understood by those skilled in the art that the table 500 may include any number of 
entries. The table 500 also defines fields for each of the entries 502 and 504. The 
fields specify (i) a document identifier 520 that uniquely identifies the document, (ii) 
the customer 522 for which the document is to be processed, (iii) a job code 524 that 
identifies the type of processing to be performed on the document, (iv) a price 526 to 

15 pay for processing the document, (v) how to receive payment 528 for processing 

performed, (vi) a required worker rating 530, (vii) a time 532 by which the document 
must be processed, (viii) other required characteristics 534 of the worker (if any), (ix) 
whether sections of the document must be processed by different workers 536, (x) 
section identifiers 538 that uniquely identify sections of the document that are to be 

20 processed by one or more workers, and (xi) whether and when the document 
processing has been completed 540. 

Referring to FIG. 6, a table 600 represents an embodiment of the 
section database 250 (FIG. 2). The table 600 includes entries 602, 604, 606, 608 and 
610, each defining a section that is to be processed by a worker. It will be understood 
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by those skilled in the art that the table 600 may include any number of entries. The 
table 600 also defines fields for each of the entries 602, 604, 606, 608 and 610. The 
fields specify (i) a section identifier 620 that uniquely identifies the section, (ii) a 
document identifier 622 of the document from which the section originated, (iii) a 
5 worker identifier 624 of the worker to which the section was sent, (iv) when the 

section was sent to the worker 626, (v) when the section was completed by the worker 
628, and (vi) a price 630 the worker charges for processing the section. 

Referring to FIG. 7, a method 700 for processing a document includes 
receiving a document from a customer (step 702). The step 702 may comprise 
10 receiving an electronic image, such as a JPEG ("Joint Photographic Experts Group") 
file or GIF ("Graphics Interchange Format"), of a paper document. Such an image 
may be generated by a device known as a "scanner" (e.g. a flatbed scanner or drum 
scanner) and sent via electronic mail to the controller 110 (FIG. 1). Other methods 
for generating the image and sending the image to the controller 110 will be 
1 5 understood by those skilled in the art. 

Alternatively, the step 702 may comprise receiving a paper document 
and then generating an electronic document (e.g. one or more images) from the paper 
document. In such an embodiment, the controller 110 may include a scanner that 
generates an electronic document from the received paper document. 
20 The document is divided into one or more sections (step 704). In one 

embodiment, each section comprises an image that is a portion of an image of the 
document. Thus, each section represents a portion of the document. As described in 
detail below, it can be advantageous to send only a section of a document to a worker. 
For example, one section of a document might include information that identifies a 

10 
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person, while another section of the document might include confidential information 
about that person. Providing these two sections to two workers would thus keep the 
person anonymous to the worker processing the section including the confidential 
information. 

5 The sections of the document are transmitted to one or more workers 

(step 706). A section may be transmitted to a worker in a number of ways, such as via 
email or via a web site. The workers receive the sections and perform corresponding 
data entry. The term "data entry" refers to the process of writing data into computer 
memory. Such computer memory may be volatile memory (e.g. RAM) or nonvolatile 

10 memory (e.g. disk subsystems or flash memory). As used herein, data entry includes 

(i) creating text from an image of a paper document by reading printing (e.g. printed 
characters) on the image and manually typing the text corresponding to the printing, 

(ii) proofreading and amending text, (iii) translating text (e.g. from one language to 
another), and (iv) creating text from an audio file by listening to the audio and 

1 5 manually typing the text corresponding to the audio. Those skilled in the art will 

understand that there are other types of data entry that the workers may perform after 
receiving a section of a document. 

In one embodiment, it may be necessary to assure that a document is 
processed by at least two workers. However, in another embodiment it may be 

20 acceptable to send sections to one worker. For example, it may be sufficient to send 
the same sections to a worker if each section cannot be matched with the others. Such 
an embodiment is especially appropriate where a large number of documents are 
processed by a single worker. The worker could not match any one section with 
another from the same document. In addition, it can be desirable to delay the 

11 
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transmission of the sections to the worker, since this reduces the ability of the worker 

to match the sections. 

Once a worker has performed the specified data entry, the worker 
transmits back to the controller 1 10 (FIG. 1) a file that corresponds to the data entry. 
5 For example, the worker may transmit a text file that represents words that were 

printed in the section the worker received. The controller 1 1 0 receives files from each 
worker (step 708) and transmits the files to the customer (step 710). The controller 
110 (FIG. 1) receives an amount of payment from the customer (step 712), and 
transmits appropriate amounts of payment to each worker (step 714). Alternatively, 
1 0 the controller 1 1 0 may initiate payment from the customer directly to each worker. 

Referring to FIG. 8, a data flow diagram 800 illustrates the transfer of 
information among the devices of the apparatus 100 (FIG. 1). The illustrated transfer 
occurs during a first stage (i.e. steps 702, 704 and 706) of the method 700. The 
customer device 120 transmits an electronic document (e.g. an image of a paper 
15 document) to the controller 1 10, which in turn divides the electronic document into 
sections. The controller 1 10 then transmits the sections to the worker devices 130 and 
135. 

An image of a document may be divided into sections in a number of 
ways. Sections of the document may be defined by one or more predetermined 
20 indicia. The customer may include indicia in an image transmitted to the controller 
1 10. For example, the document may be partially preprinted with the indicia, or the 
customer may add indicia to an electronic image of the document that it generates 
from a paper document. Alternatively, the indicia may be added after the customer 
provides the document. For example, the image may be marked with the indicia by 

12 
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the controller 110. In one embodiment, documents may be marked with indicia in 
accordance with a predetermined process, and each customer may have one (or more) 
associated predetermined processes. For example, the controller 1 10 may add 
predetermined indicia to all documents received from a first predetermined customer. 
5 In such an embodiment, all documents received from the first predetermined customer 
would be divided into sections in accordance with the same process. This can be 
especially advantageous for businesses having a large number of documents of the 
same type (e.g. medical claim forms). 

FIGS. 9A, 9B, 9C and 9D illustrate different ways to identify sections 
1 0 with indicia. In FIG. 9A, an image 900 includes indicia 902 that are in the form of a 
demarcation line. The indicia 902 thereby define two sections 904 and 906. In such 
an embodiment, it may be predetermined, for example, that the top section 904 
contains information that identifies a person and the bottom section 906 contains 
confidential information about that person. 
1 5 In FIG. 9B, an image 920 includes indicia 922, 924 and 926, each in 

the form of a boundary line that defines the perimeters of one of three sections 928, 
930 and 932, respectively. Alternatively, more than one indicia can indicate a 
plurality of disjointed portions of the same section. For example, the indicia 922 and 
926 may indicate two disjointed portions of the same section, while the indicia 924 
20 can indicate another, different section. In such an embodiment, the indicia 922 and 
926 may both be of the same type (to denote portions of the same section) while the 
indicia 924 are of a different type. For example, the indicia 922 and 926 may both be 
of an identical color, width or other style, which is different from that of the indicia 
924. 

13 
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In FIG. 9C, an image 940 includes indicia 942 and 944. The indicia 
942 are in the form of a demarcation line that defines two sections 946 and 948. In 
such an embodiment, the indicia 944 may differentiate between the sections by 
identifying the section in which the indicia 944 is located. For example, the indicia 
5 944 may indicate that the section 948 (which contains the indicia) contains 

information that identifies a person. In one embodiment, it would then be assumed 
that the remaining section 946 contains confidential information about that person. 

In FIG. 9D, an image 960 includes indicia 962, 964 and 966. The 
indicia 962 are in the form of a demarcation line that defines two sections 968 and 
1 0 970. The section 968 includes indicia 964 in the form of a bar code, and the section 
970 includes indicia 966 that are also in the form of a bar code. In such an 
embodiment, the indicia 964 and 966 may identify the sections 968 and 970 
respectively. For example, the indicia 964 may indicate that the corresponding 
section (the section 968) contains information that identifies a person, while indicia 
1 5 966 may indicate that the corresponding section (the section 970) contains 
confidential information about that person. 

Referring to FIG. 10, an image 1000 of a document includes indicia 
1002 and 1004, each in the form of a boundary line that defines the perimeters of two 
sections 1012 and 1014, respectively. The section 1012 contains information that 
20 identifies a person (i.e. a name, social security number and address). The section 
1014 contains confidential information about that person (i.e. medical information). 

Sections of the document may alternatively be identified as 
predetermined locations within a document. For example, the top half of a document 
may be a first section and the bottom half may be a second section. In one 

14 
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embodiment, documents may be divided into sections in accordance with a 
predetermined set of locations, and each customer may have one (or more) associated 
predetermined sets of locations. For example, all documents received from a first 
customer may be divided into three identically sized sections: a bottom section, a 
5 middle section and a top section. 

Sections may be mutually exclusive so that no section overlaps another 
section. Conversely, sections may overlap so that a portion of a document is included 
in more than one section. Furthermore, a document may include one or more 
sections, but there may be portions of a document that are not included in any section. 

10 Such portions are thus not provided to workers. For example, a portion of a document 
that includes a standard letterhead logo may need no processing. In addition, as 
described above a section may include two disjointed portions of a document. 

Different sections of a document may include different types of 
information. For example, a first section may include confidential information about 

1 5 a person, while a second section includes information that identifies that person. 
Sections may also contain the same type of information, yet each section would 
include a small portion of such information. This is advantageous where there exists 
a large amount of one type of information, and it would be easier to break up the 
information into more manageable sizes. 

20 The image 1000 also includes an identifier 1020 that is in the form of a 

set of characters. Although in FIG. 10 the identifier 1020 is in the form of a set of 
characters, in other embodiments the identifier may be in another form, such as a bar 
code. The identifier 1020 identifies processing criteria (the customer's requirements 
and/or preferences for processing the document). The identifier may indicate, for 
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example, an amount of payment to be paid for processing the document, billing 
information, a job code that specifies the type of processing required, a required rating 
of the worker(s), an indication of whether sections of the document can go to a single 
worker, a time when completion by the worker is required, and required 
5 characteristics of the worker. 

The identifier 1020 on the image 1000 may indicate the document 
identifier described above with reference to FIG. 5. In such an embodiment, the 
document identifier may be used to access further information from the document 
database 240 (FIG. 2) from which processing criteria may be derived. In another 
10 embodiment, the identifier may represent the processing criteria. For example, if the 
identifier is a set of characters, the first two characters may indicate billing 
information, the second three characters may indicate the type of processing required 
and the final eight characters may indicate a time when completion by the worker is 
required. Similarly, if the identifier is a bar code that represents a sequence of digits, 
15 the various digits may represent various types of processing criteria, as would be 
apparent to one skilled in the art. 

FIGS. 1 1 A and 1 IB each illustrate a section of a document that may be 
sent to a worker. The section 1 100 (FIG. 1 1 A) corresponds to the section 1012 of 
FIG. 10, and the section 1 150 (FIG. 1 IB) corresponds to the section 1014 of FIG. 10. 
20 The section 1 1 00 includes a bar code 1 1 02 that indicates worker criteria for the 
section 1 1 00. Similarly, the section 1150 includes a bar code 1152 that indicates 
worker criteria for the section 1 1 50. As described below, the worker criteria of a 
section specify requirements and/or preferences for processing the document that are 
provided to the worker. For example, the worker criteria may indicate a time of 
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completion for the worker. In one embodiment, the worker criteria are based, at least 
in part, on the processing criteria received from the customer. In one embodiment, 
the worker criteria may simply indicate the section's corresponding document or 
customer. 

5 FIG. 12 illustrates a method 1200 for processing a document in 

accordance with an embodiment of the present invention. The method 1200 generally 
describes an embodiment of the steps 702, 704 and 706 of the method 700 (FIG. 7). 

A paper document is received from a customer (step 1202) and a 
corresponding electronic document (e.g. an image of the paper document) is 

10 generated (step 1204). In an alternate embodiment, an electronic document is 
received (e.g. via email). Processing criteria associated with the image are also 
received (step 1206). As described above, the processing criteria indicate the 
customer's requirements and/or preferences for processing the document. The 
processing criteria may comprise a job code that identifies the type of data entry to be 

1 5 performed on the document. Similarly, the processing criteria may comprise a 

requirement that a section is processed by two or more workers, so that one worker 
can "proofread" the work of another, or so that inaccurate processing performed by 
any one or more workers may be more readily detected. 

The processing criteria may be machine-readable (e.g. in the form of a 

20 bar code) to automate reading and interpreting the processing criteria. Alternatively, 
the processing criteria may be text or other indicia that is not interpreted by the 
controller 110, but that is read by human personnel who direct the operations of the 
controller 1 10 accordingly. In another embodiment, the processing criteria may be 
text or other indicia that is merely made available to be read by workers. 
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The processing criteria may be received separately from the document, 
or may be incorporated into the image of the document (e.g. as a bar code or other 
indicia on the document). The processing criteria may be received substantially 
simultaneously with the paper document. Typically, separate processing criteria 
5 would accompany each paper document. In an alternate embodiment, processing 
criteria may be received from a customer, and the processing criteria are then applied 
to all documents received from that customer. In such an embodiment, the customer 
may, for example, desire that only one type of data entry ever be performed. 

The controller 110 then determines a first section of the image that is 
10 indicated by first predetermined indicia (step 1208), and determines a second section 
of the image that is indicated by second predetermined indicia (step 1210). In one 
embodiment, the first predetermined indicia may be the second predetermined indicia. 
For example, referring again to FIG. 9A, the indicia 902 are in the form of a 
demarcation line that defines two sections 904 and 906. Thus, the same indicia 902 
1 5 indicate both sections 904 and 906. 

Based on the processing criteria, the controller 1 10 selects a first 
worker and a second worker (steps 1212 and 1214). In one embodiment, a worker is 
selected if the type of work that the worker performs corresponds to the job code 
included in the processing criteria. For example, if the job code identifies that a 
20 document must be translated from German to English, then a worker having the 
requisite job skills (i.e. German language translation) is selected. Various methods 
for selecting workers are described below. 

After the first worker and the second worker are selected, the first 
section is transmitted to the first worker (step 1216) and the second section is 
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transmitted to the second worker (step 1218). In addition, a first identifier that 
identifies the document is transmitted to the first worker (step 1220), and a second 
identifier that also identifies the document is transmitted to the second worker (step 
1222). The first identifier and the second identifier may be identical but need not be. 
5 For example, the first identifier may be "Dl" and the second identifier may be "El", 
yet each may refer to the same document (e.g. the last character may uniquely identify 
the document). In another embodiment, the first identifier and the second identifier 
may identify a customer instead of the document. Thus, the sections would still 
correspond to a customer, and each corresponding file received in response to a 

10 transmitted section would correspond to a customer. 

The worker criteria may include the above-described identifiers. 
Furthermore, the worker criteria may be transmitted and received in a manner similar 
to that described herein with regard to the processing criteria. For example, the first 
identifier and the second identifier may be transmitted separately, or may be 

15 incorporated into the sections themselves. For example, the identifiers may comprise 
a bar code, and the controller 110 could label each section with an identifier in the 
form of a bar code. Sections and identifiers may be transmitted in several ways, such 
as via email or via a site on the World Wide Web. 

Those skilled in the art will understand that the above-described step of 

20 transmitting a first section to a first worker and a second section to a second worker 
may comprise transmitting the first section to a worker in a first group of workers and 
the second section to a worker in a second group of workers. In such an embodiment, 
it may be desirable that the first group of workers are qualified to receive a first type 
of section (e.g. a section that includes information that identifies a person) and that the 
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second group of workers are qualified to receive a second type of section (e.g. a 
section that includes confidential information about that person). Accordingly, it may 
be necessary to ensure the workers in that one group (or both groups) satisfy 
minimum requirements (e.g. no criminal background). 
5 Referring to FIG. 13, a data flow diagram 1300 illustrates the transfer 

of information among the devices of the apparatus 100 (FIG. 1). The illustrated 
transfer occurs during a second stage (i.e. steps 708 and 710) of the method 700, 
which is after that illustrated by FIG. 8. The worker devices 130 and 135 each 
transmit a file to the controller 110. The controller 1 10 then transmits the files to the 

10 customer device 120. The files may be transmitted to the customer device 120 

separately, or the controller 110 may combine the files and transmit the combined file 
to the customer device 120. 

Referring to FIG. 14, a method 1400 for processing a document 
includes receiving a first file from a first worker (step 1402). The first file 

1 5 corresponds to data entry performed on a first section that was previously transmitted 
to the first worker. As described above, the first section is derived from a document 
and may be a portion of the document. For example, the first file may be a text file of 
manually typed text. In another embodiment described in more detail below, the first 
file may be data that is created by the worker via a form and stored according to 

20 various fields of the form. 

The controller 1 10 may also receive a first identifier that identifies a 
first document (from which the first section was derived). For example, the first 
identifier may be a document identifier as illustrated in field 520 (FIG. 5). The first 
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identifier may also be a section identifier as illustrated in the field 620 (FIG. 6), which 
would, in turn, indicate a document, as is apparent from the field 622 (FIG. 6). 

The first identifier may be received substantially simultaneously with 
the first file. For example, the first identifier may be received immediately before or 
5 immediately after the first file is received, thereby associating the first file with the 
first identifier. The first file may also include the first identifier, thereby allowing the 
document to be identified from the file. Other methods for identifying the first 
document from which the first section was derived will be understood by those skilled 
in the art. 

10 A second file is received from a second worker (step 1404). The 

second file corresponds to data entry performed on a second section that was 
previously transmitted to the second worker. A second identifier also identifies a 
second document from which the second section was derived. The second file and the 
step of receiving the second file are analogous to the description above of the step 

15 1402. 

The controller 110 determines whether the first document and the 
second document are equivalent (step 1406). If so, then the first section and the 
second section are derived from the same document, and accordingly the first file and 
the second file represent data entry performed in connection with the same document. 
20 Consequently, in one embodiment both the first file and the second file are destined 
for the same customer. The customer that is associated with both the first file and the 
second file is determined (step 1408). For example, referring again to FIG. 5, each 
document corresponds to a customer, as indicated by fields 520 and 522 of the 
document database 240. Thus, the customer may be determined from the document 
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database 240. The first file and the second file are transmitted to the customer (step 
1410). As described above, the files may be transmitted to the customer device 120 
separately, or the controller 1 10 may combine the files and transmit the combined file 
to the customer device 120. 
5 The workers are rated based on their work (step 1412). For example, 

the first worker is rated based on the first file and possibly based on a previous rating, 
if any. Rating a worker is described in the parent application, U.S. Patent Application 
No. 09/1 12,131, entitled "METHOD AND APPARATUS FOR A 
CRYPTOGRAPHICALLY-ASSISTED COMMERCIAL NETWORK SYSTEM 

1 0 DESIGNED TO FACILITATE AND SUPPORT EXPERT-BASED COMMERCE", 
filed on July 8, 1998. 

Although two sections and two corresponding files were described 
with reference to FIG. 14, a document may include one or more sections that are 
processed. Accordingly, in one embodiment, it may be desirable to determine the 

15 number of sections in a document, and all corresponding files are transmitted only 
after a predetermined number of files (e.g. the number of sections) have been 
received. Alternatively, each file may be transmitted to the customer when it is 
received. 

Referring to FIG. 15, a form 1500 illustrates a form that may be used 
20 to allow a worker to enter data based on the section 1 1 00 (FIG. 1 1 ) and to store that 
data according to fields of the form 1500. The form 1500 includes a form field 1502 
that permits entry of a name, a form field 1504 that permits entry of a social security 
number and a form field 1506. Forms may be sent along with a corresponding 
document from the customer to the controller 110 (FIG. 1) and in turn to the worker. 
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Forms may also be provided to a worker via a web site. For example, a worker may 
be provided with a "URL" (Uniform Resource Locator) that identifies a web site or 
other computer-based communication channel. Upon accessing the URL, the form 
may be presented to the worker. Those skilled in the art will understand that various 
5 types of forms may be readily displayed. For example, HTML forms are well known 
to those skilled in the art, as are proprietary form formats such as PDF (portable 
document format) sold by Adobe and JetForm. 

In one embodiment, the form may be accessible through a web site 
after the worker enters an access code at the web site. The access code may serve as 

10 authorization that the worker is allowed to access a form. The access code may 
additionally or alternatively uniquely identify one of a plurality of forms that are 
available via the web site. The access code may be provided to the worker in the 
worker criteria discussed above. The access code may be provided to the worker by 
other means as well, such as transmitted along with the section. 

1 5 Data that is entered via a form may be transmitted to the controller 1 1 0 

which in turn transmits the data to the customer device 120. Alternatively, the data 
entered via the form may be transmitted directly to the customer device 120 and/or a 
database specified by the customer. 

Referring to FIG. 16, a table 1600 represents a form database. The 

20 form database may be a part of the section database 250 (FIG. 2). The table 1600 
includes entries 1602, 1604, 1606, 1608 and 1610, each defining a form to enter data 
in connection with processing a section. It will be understood by those skilled in the 
art that the table 1600 may include any number of entries. The table 1600 also 
defines fields for each of the entries 1602, 1604, 1606, 1608 and 1610. The fields 
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specify (i) a section identifier 1620 that uniquely identifies the section, (ii) a form 
identifier 1622 that uniquely identifies the form, (iii) a URL 1624 that identifies a web 
site where the form may be accessed, and (iv) an access code 1626 that is required to 
access the web site and/or specify a particular form available at the web site. 
5 One URL may be specified for more than one form. For example, the 

entries 1606, 1608 and 1610 all have the same URL "www.z.com". However, at least 
three forms "F3", "F4" and "F5" are all accessible using that URL. In particular, 
different access codes distinguish among the different forms. 

As described above, payment may be rendered either directly or 

10 indirectly from customer to worker. Payment usually comprises a crediting of an 
account, such as a bank account or credit card account. However, payment may also 
be rendered via check or other means. The customer may be billed after any 
predetermined event, such as when the worker transmits the appropriate files to the 
customer. In one embodiment, the customer may be billed periodically (e.g. every 

15 month) for all fees due to workers. 

Payment due may be defined by the customer, such as when the 
customer sets forth an amount to be paid for a particular document. Payment may 
also be defined by the worker, such as when the worker charges a time-based or job- 
based fee. Alternatively, a plurality of workers may participate in a "Dutch" auction 

20 and compete for the lowest amount or rate of pay they will accept in exchange for a 
particular section or type of work. The lowest bidder would then earn the right to 
perform the specified work in exchange for the highest bid amount. Similarly, a 
plurality of workers may participate in a "reverse" auction in which bidding starts low 
and increases until all sections have been accepted by workers. Other types of 
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auctions may take place among workers to specify pay rates and terms. In addition, 
just as workers may compete in an auction for sellers, sellers may compete in an 
auction for workers. 

A section of a document may include significant amounts of 
5 information, and processing such a section may require a significant amount of time. 
Consequently, it can be advantageous to allow a plurality of workers to participate in 
processing a single section. Similarly, it can be advantageous to allow a worker to 
receive a payment upon processing a portion of a section. Accordingly, the present 
invention permits a section to be partially processed by a first worker, and a second 

10 worker can further process the section. For example, a first worker may create a first 
text file representing the first page of a document, and a second worker may create a 
second text file representing the remaining pages of the document. 

FIG. 17 illustrates a method 1700 for permitting a section to be 
processed by a plurality of workers. A section is transmitted to a worker (step 1 702), 

15 as discussed above. In response, a file is received from the worker (step 1704), as 
also discussed above. The controller 1 10 then determines whether the section has 
been completely processed (step 1706). In other words, the controller 1 10 determines 
whether there is a portion of the section on which processing was not performed. 

If there is a portion of the section on which processing was not 

20 performed, then at least that portion (possibly the entire section) is transmitted to 

another worker (step 1708). The file from the original worker may also be sent to the 
subsequent worker to allow that worker to more readily determine which portions of 
the section have been processed. Subsequently, a file is received from the subsequent 
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worker (step 1710). The steps 1706, 1708 and 1710 are repeated until the section has 
been completely processed by one or more workers. 

All received files may be combined to form a combined file (step 
1712). The combined file is in turn transmitted to the customer (step 1714). 
5 Alternatively, the files may be transmitted to the customer upon receipt. In an 

embodiment where the combined file is created, the customer need not know that the 
section has been processed by a plurality of workers. 

In order to determine whether the section has been completely 
processed, a "stop indicator" may be received with, or included in, the file. A stop 

10 indicator indicates where in the section the worker stopped processing the section, and 
therefor indicates the portion of the section that has not been processed. The stop 
indicator may be a textual description of the portion of the section on which 
processing was not performed. For example, the stop indicator may comprise the text 
"Until page 3, second paragraph". 

15 In another embodiment, a file received from a worker may include the 

portion of the section that has not been processed. Alternatively, a file received from 
a worker may include the section which the worker has marked with indicia that 
indicates a portion of the section on which processing was not performed. Such 
indicia may be considered a "stop indicator" that indicates were the worker stopped 

20 processing the section. For example, the stop indicator may comprise a boundary line 
that separates a portion that was processed from a portion that was not processed. 

Referring to FIG. 1 8, a table 1 800 represents a record of another 
embodiment of the section database 250 (FIG. 2). In the illustrated embodiment, each 
section may be processed by one or more workers, and accordingly the processing 
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performed by each worker is recorded. Accordingly, each record corresponds to a 
section as indicated by the section identifier 1 802. The record also includes a 
document identifier 1804 to identify the document from which the section originated. 
The record further includes entries 1806, 1808 and 1810, each of which defines a 
5 portion of the section that was processed by a worker. The table 1 800 also defines 
fields for each of the entries 1806, 1808 and 1810. The fields specify (i) a worker 
identifier 1820 of the worker to which the portion was sent, (ii) when the portion was 
sent to the worker 1822, (iii) when the portion was completed by the worker 1824, 
(iv) a price 1826 the worker charges for processing the portion, and (v) a percentage 

1 0 completion 1 828 of the section after the specified worker completes processing his 
respective portion. 

It can be advantageous to allow a customer to specify processing 
criteria for a group of documents, rather than processing criteria for each document. 
A customer may only be concerned with a "high level" goal, and thus want to specify 

15 high level processing criteria for the group of documents. For example, a customer 
may wish to specify that a group of one thousand documents is processed within a 
week for an average cost of under $10 per document. Accordingly, in one 
embodiment of the present invention a customer may specify processing criteria that 
apply to a plurality of documents. The controller 1 10 (FIG. 1) then apportions the 

20 documents among workers in a manner that satisfies the processing criteria. 

FIG. 19 illustrates a method 1900 for processing a plurality of 
documents in accordance with processing criteria for the plurality. The controller 110 
receives a plurality of documents from the customer (step 1902), and also receives 
processing criteria applicable to the plurality of documents (step 1904). The 
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documents and processing criteria may be received in a manner similar to that 
described above. The processing criteria may specify, for example, a number of the 
plurality of documents to process in a predetermined time period (e.g. at least five of 
these documents must be processed within a week), a required accuracy of processing 
5 (e.g. the documents must be at least 99% accurately transcribed), and a desired cost 
for processing the plurality of documents (no more than $1000 to process the 
plurality). 

A plurality of workers are selected based on the processing criteria 
(step 1906). For example, the controller 1 10 may select ten workers that are qualified 

1 0 for the task and that also have the lowest rates. Similarly, the controller 1 1 0 may 
select twenty of the fastest workers if there is not much time. In one embodiment, 
work may be apportioned among a first set of workers that perform typing, and a 
second set of workers that proofread the files generated by the first set of workers. In 
such an embodiment, the controller 1 10 may determine whether it is more economical 

15 to hire more of the first set than of the second set to achieve the same overall level of 
accuracy. 

The workers may also be selected based on the work loads of the 
workers. For example, if a particular worker has been assigned a large number of 
sections and has not processed many of those sections, the controller 1 10 may 
20 determine that the worker has a large work load and thus is unlikely to provide rapid 
service. Conversely, if a particular worker has few or no unprocessed sections, the 
controller 110 may determine that the worker has a small work load and thus is likely 
to provide rapid service. In addition, the past response rate (duration from receiving a 
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section and completing processing for the section) of a worker may be considered in 
determining his future response rate. 

The documents and/or sections of the documents are apportioned 
among the plurality of workers (step 1908) so each worker is assigned a section of a 
5 document and all sections are allocated. The sections are transmitted to the 

appropriate worker (step 1910). Transmitting sections to workers and subsequently 
receiving files from those workers has been described above. 

Although the present invention has been described with respect to a 
preferred embodiment thereof, those skilled in the art will note that various 
10 substitutions may be made to those embodiments described herein without departing 
from the spirit and scope of the present invention. 
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What is claimed is: 

1 . A method for processing a document, comprising: 
determining a first section of a document; 

5 determining a second section of the document; 

transmitting the first section of the document to a first worker to perform 
corresponding data entry; 

transmitting to the first worker a first identifier that identifies the document; 
transmitting the second section of the document to a second worker to perform 
1 0 corresponding data entry; and 

transmitting to the second worker a second identifier that identifies the 
document. 

2. The method of claim 1 , further comprising: 

1 5 providing the first worker with access to a first database to enter data 

corresponding to the first section. 

3. The method of claim 1, in which the document comprises an image. 

20 4. The method of claim 3, further comprising: 

generating the image by scanning a paper document. 

5. The method of claim 3, further comprising: 
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marking the image with at least one of the first predetermined indicia and the 
second predetermined indicia. 



6. The method of claim 3, in which the first section comprises a portion of the 
5 image. 

7. The method of claim 1, in which the first predetermined indicia is the second 
predetermined indicia. 

10 8. The method of claim 1, in which the first predetermined indicia comprises: 
an indication of a boundary between the first section of the document and the 
second section of the document. 

9. The method of claim 8, in which the first predetermined indicia comprises: 

15 an indication of one of the first section of the document and the second section 

of the document. 

10. The method of claim 1, in which the first predetermined indicia comprises a 
bar code. 

20 

1 1 . The method of claim 1 , further comprising: 

labeling the first section with an identifier that indicates a customer. 
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12. The method of claim 1 1, in which the identifier that indicates a customer 
comprises: 

a bar code. 

5 13. The method of claim 1 , further comprising: 
receiving processing criteria. 

14. The method of claim 13, in which the processing criteria indicates at least one 
of: 

10 an amount of payment to be paid, 

billing information, 
a job code, 

a required rating of the first worker, 

an indication of whether sections of the document can go to a single worker, 
15 a time of completion by the first worker, and 

a required characteristic of the first worker. 

15. The method of claim 13, in which the document includes a bar code that 
indicates the processing criteria. 



20 



16. The method of claim 13, in which the document includes a set of characters 
that indicate the processing criteria. 

1 7. The method of claim 1 , further comprising: 
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transmitting worker criteria to the first worker. 

18. The method of claim 17, in which the worker criteria indicates at least one of: 
a customer identifier, 
5 an amount of payment to be paid, 

a job code, 

billing information, and 

a time of completion by the first worker. 

10 19. The method of claim 1 7, in which the first section is labeled with a bar code 
that indicates the worker criteria. 

20. The method of claim 1 7, further comprising: 

creating the worker criteria from received processing criteria. 

15 

21 . The method of claim 17, in which the first section includes a set of characters 
that indicate the worker criteria. 

22 . The method of claim 1 , further comprising: 
20 ascertaining a size of the first section. 

23. The method of claim 22, further comprising: 
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if the size of the first section is greater than a predetermined threshold, 
creating a third section and a fourth section from the first section, the third section and 
the fourth section each having a size less than the size of the first section. 



5 24. The method of claim 23, further comprising: 

transmitting the third section of the document to a third worker to perform 
corresponding data entry; and 

transmitting the fourth section of the document to a fourth worker to perform 
corresponding data entry. 

10 

25. The method of claim 1, further comprising: 

providing the first worker with access to a form that permits entry of data for 
the first section. 



1 5 26. The method of claim 25, in which the form comprises an HTML form. 

27. The method of claim 1, further comprising: 

receiving an offer price to be paid in exchange for performance of 
corresponding data entry. 

20 

28. The method of claim 27, further comprising: 

receiving a payment identifier that specifies a financial account to pay the 
offer price. 
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29. The method of claim 1, in which the step of determining a first section of a 
document comprises: 

determining a first section of a document, the first section being indicated by 
first predetermined indicia. 

5 

30. An apparatus for processing a document, comprising: 
a processor; and 

a storage device in communication with the processor, the storage device 
storing a program for directing the processor to: 
1 0 determine a first section of a document, and 

determine a second section of the document; and 
transmit the first section of the document to a first worker to perform 
corresponding data entry; 

transmit to the first worker a first identifier that identifies the 

1 5 document; 

transmit the second section of the document to a second worker to 
perform corresponding data entry; and 

transmit to the second worker a second identifier that identifies the 

document. 



20 



31. A method for processing a document, comprising: 

receiving a job code associated with an electronic document; 

determining a first section of the electronic document that is indicated by first 

predetermined indicia; 
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determining a second section of the electronic document that is indicated by 
second predetermined indicia; 

selecting, based on the job code, a first worker from a plurality of workers; 

transmitting the first section of the electronic document to the first worker to 
5 perform corresponding data entry; 

transmitting to the first worker a first identifier that identifies the electronic 
document; 

selecting, based on the job code, a second worker from the plurality of 
workers; 

10 transmitting the second section of the electronic document to the second 

worker to perform corresponding data entry, the second worker being different from 
the first worker; and 

transmitting to the second worker a second identifier that identifies the 
electronic document. 

15 

32. The method of claim 3 1 , in which the job code specifies a type of work to be 
performed on the electronic document. 



33. The method of claim 32, in which the step of selecting a first worker 
20 comprises: 

selecting a first worker if a type of work that the first worker performs 
corresponds to the job code. 
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34. The method of claim 31, in which the step of selecting a first worker 
comprises: 

determining a bid price each of plurality of workers is willing to accept; and 
selecting a worker having a lowest bid price. 

5 

35. The method of claim 31, further comprising: 
generating the electronic document from a paper document. 



36. A method for processing a document, comprising: 
10 receiving a first file from a first worker, the first file corresponding to data 

entry performed on a first previously-transmitted section of a document; 

receiving a second file from a second worker, the second file corresponding to 
data entry performed on a second previously-transmitted section of the document; 

determining a customer associated with both the first file and the second file; 

15 and 

transmitting the first file and the second file to the customer. 



37. The method of claim 36, further comprising: 
rating the first worker based on the first file. 

20 

38. The method of claim 37, in which the step of rating comprises: 

rating the first worker based on the first file and a previous rating of the first 

worker. 



37 



WO 00/03321 PCT/US99/I5348 

39. The method of claim 36, in which the step of transmitting the first file and the 
second file to the customer comprises: 

creating a combined file that includes the first file and the second file; and 
transmitting the combined file to the customer. 

40. The method of claim 36, in which the step of transmitting the first file and the 
second file to the customer is performed after a predetermined number of files have 
been received. 



10 41 . The method of claim 36, further comprising: 
paying the first worker. 

42. An apparatus for processing a document, comprising: 
a processor; and 

15 a storage device in communication with the processor, the storage device 

storing a program for directing the processor to: 

receive a first file from a first worker, the first file corresponding to 
data entry performed on a first previously-transmitted section of a document; 

receive a second file from a second worker, the second file 
20 corresponding to data entry performed on a second previously-transmitted section of 
the document; 

determine a customer associated with both the first file and the second 

file; and 

transmit the first file and the second file to the customer. 
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43. A method for processing a document, comprising: 

receiving a first file from a first worker, the first file corresponding to data 
entry performed on a first previously-transmitted section of a first document, the first 
5 file indicating a first identifier that identifies the first document; 

receiving a second file from a second worker, the second file corresponding to 
data entry performed on a second previously-transmitted section of a second 
document, the second file indicating a second identifier that identifies the second 
document; 

10 determining whether the first document and the second document are 

equivalent; 

determining a customer associated with both the first file and the second file; 

and 

transmitting the first file and the second file to the customer. 

15 

44. A method for processing a document, comprising: 
determining a section of a document, the section being indicated by 

predetermined indicia; 

transmitting the section of the document to a worker to perform corresponding 
20 data entry; 

receiving a file from the worker, the file corresponding to data entry 
performed on the section of the document; 

determining a customer associated with the file; and 
transmitting the file to the customer. 
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45. The method of claim 44, in which the step of transmitting the section of the 
document to a worker comprises: 

transmitting the section of the document to a first worker and to a second 
5 worker; 

and in which the step of receiving a file from the worker comprises: 
receiving a first file from the first worker; and 
receiving a second file from the second worker. 

10 46. The method of claim 45, further comprising: 
comparing the first file and the second file. 

47. The method of claim 44, further comprising: 
transmitting the file to an additional worker; and 

1 5 receiving from the additional worker an indication of a quality of the file. 

48. The method of claim 44, further comprising: 
receiving the document from the customer. 

20 49. The method of claim 48, in which the document comprises an image. 

50. An apparatus for processing a document, comprising: 
a processor; and 
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a storage device in communication with the processor, the storage device 
storing a program for directing the processor to: 

determine a section of a document, the section being indicated by 
predetermined indicia; 
5 transmit the section of the document to a worker to perform 

corresponding data entry; 

receive a file from the worker, the file corresponding to data entry 
performed on the section of the document; 

determine a customer associated with the file; and 
1 0 transmit the file to the customer. 



51. A method for processing a document, comprising: 

determining a first section of a document that is indicated by first 
predetermined indicia; 
1 5 determining a second section of the document that is indicated by second 

predetermined indicia; 

transmitting the first section of the document to a first worker to perform 
corresponding data entry; 

transmitting to the first worker a first identifier that identifies the document; 
20 transmitting the second section of the document to a second worker to perform 

corresponding data entry; 

transmitting to the second worker a second identifier that identifies the 
document; 
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receiving a first file from the first worker, the first file corresponding to data 
entry performed on the first section of the document; and 

receiving a second file from the second worker, the second file corresponding 
to data entry performed on the second section of the document. 

5 

52. A method for processing a document, comprising: 

generating an electronic document from a paper document; 
receiving a job code associated with the electronic document; 
determining a first section of the electronic document that is indicated by first 
1 0 predetermined indicia; 

determining a second section of the electronic document that is indicated by 
second predetermined indicia; 

selecting, based on the job code, a first worker from a plurality of workers; 
transmitting the first section of the electronic document to the first worker to 
1 5 perform corresponding data entry; 

transmitting to the first worker a first identifier that identifies the electronic 
document; 

selecting, based on the job code, a second worker from the plurality of 
workers; 

20 transmitting the second section of the electronic document to the second 

worker to perform corresponding data entry, the second worker being different from 
the first worker; 

transmitting to the second worker a second identifier that identifies the 
electronic document; 

42 



WO 00/03321 PCT/US99/15348 

receiving a first file from the first worker, the first file indicating a third 
identifier; 

receiving a second file from the second worker, the second file indicating a 
fourth identifier; 

5 comparing the third identifier and the fourth identifier to determine whether 

both the third identifier and the fourth identifier identify the electronic document; 

determining a customer associated with both the first file and the second file; 

and 

transmitting the first file and the second file to the customer. 

10 

53. A method for processing a document, comprising: 
receiving a plurality of documents; 

receiving processing criteria for processing the plurality of documents; 
selecting a plurality of workers in accordance with the processing criteria; and 
15 apportioning the plurality of documents among the plurality of workers in 

accordance with the processing criteria. 

54. The method of claim 53, in which the criteria for processing includes at least 
one of: 

20 a number of the plurality of documents to process in a predetermined time 

period, 

a required accuracy of processing, and 
a desired cost. 
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55. The method of claim 53, in which the step of selecting a plurality of worker 
comprises: 

selecting a plurality of workers in accordance with the processing criteria and 
in accordance with at least one of work loads and response rates of the plurality of 
5 workers. 



56. The method of claim 53, further comprising: 

transmitting the sections to the plurality of workers to perform corresponding 
data entry. 

10 

57. An apparatus for processing a document, comprising: 
a processor; and 

a storage device in communication with the processor, the storage device 
storing a program for directing the processor to: 
1 5 receive a plurality of documents; 

receive processing criteria for processing the plurality of documents; 
select a plurality of workers in accordance with the processing criteria; 

and 

apportion the plurality of documents among the plurality of workers in 
20 accordance with the processing criteria. 



58. A method for processing a document, comprising: 
receiving a plurality of documents; 

receiving processing criteria for processing the plurality of documents; 
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selecting a plurality of workers in accordance with the processing criteria; 
determining for each document a section that is indicated by predetermined 
indicia; and 

apportioning the plurality of documents among the plurality of workers in 
5 accordance with the processing criteria; and 

transmitting the sections to the plurality of workers to perform corresponding 
data entry. 

59. A method for processing a document, comprising: 

10 determining a section of the document that is indicated by first predetermined 

indicia; 

transmitting the section of the document to a first worker to perform 
corresponding data entry; 

receiving a file from the first worker, the file corresponding to data entry 
1 5 performed on the section of the document, 

determining a first portion of the section on which processing was not 
performed; and 

transmitting the first portion of the section to a second worker to perform 
corresponding data entry. 

20 

60. The method of claim 59, in which the file comprises the section of the 
document. 
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61 . The method of claim 59, in which the step of determining a first portion of the 
section on which processing was not performed comprises: 

receiving a stop indicator that indicates a first portion of the section on which 
processing was not performed. 

5 

62. The method of claim 61, in which the file includes the stop indicator. 

63. The method of claim 62, in which the file includes second predetermined 
indicia indicating the first portion of the section on which processing was not 

10 performed. 

64. The method of claim 61, in which the stop indicator comprises a textual 
description of a first portion of the section on which processing was not performed. 

15 65. An apparatus for processing a document, comprising: 
a processor; and 

a storage device in communication with the processor, the storage device 
storing a program for directing the processor to: 

determine a section of the document that is indicated by first 
20 predetermined indicia; 

transmit the section of the document to a first worker to perform 
corresponding data entry; 

receive a file from the first worker, the file corresponding to data entry 
performed on the section of the document, 
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determine a first portion of the section on which processing was not 
performed; and 

transmit the first portion of the section to a second worker to perform 
corresponding data entry. 
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